Learning classifier systems for decision making in continuous-valued domains

نویسنده

  • Chritopher Stone
چکیده

This thesis investigates Learning Classifier System architectures for decision making in continuous-valued domains. The information contained in continuous-valued domains is not always conveniently expressed using the ternary representation typically used by Learning Classifier Systems and an interval-based representation is a natural choice. Two intervalbased representations recently proposed are analysed, together with their associated operators. Evidence of considerable representational and operator bias is found. A new interval-based representation is proposed that is more straightforward than the previous ones and its bias is analysed. Learning Classifier Systems are compared for online environments that consist of real-valued states and which require every action made by the agent to count towards its performance. Two Learning Classifier System architectures are considered, XCS and ZCS. An interval representation is used for the rule conditions and a roulette wheel is used for action selection. The performance of these two Learning Classifier System architectures is investigated on a set of abstract environments with both deterministic and stochastic reward functions. Although XCS clearly delivers superior performance in the deterministic environments tested, the simple ZCS architecture is found to be robust and able to equal or exceed the performance of XCS in the stochastic environments tested, especially those with more demanding characteristics. Aspects of the algorithm and parameter set of ZCS are studied on problems with real-valued states and a Boolean action space. Increased performance is found to result from the use of an update algorithm based on that of NewBoole, an earlier strength-based Learning Classifier System. A new operator, specialize, is introduced and found to be effective in combatting over general classifiers. The modified algorithm and parameter set is tested on several variants of three real-valued test problems. The resulting Learning Classifier System is applied to simulated Foreign Exchange trading using an experimental setup and data previously presented in the literature. Results show that a simple Learning Classifier System is able to achieve a positive excess return in simulated trading.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generalized interval-valued intuitionistic fuzzy Hamacher generalized Shapley Choquet integral operators for multicriteria decision making

The interval-valued intuitionistic fuzzy set (IVIFS) which is an extension of the Atanassov’s intuitionistic fuzzy set is a powerful tool for modeling real life decision making problems. In this paper, we propose the emph{generalized interval-valued intuitionistic fuzzy Hamacher generalized Shapley Choquet integral} (GIVIFHGSCI) and the emph{interval-valued intuitionistic fuzzy Hamacher general...

متن کامل

INCOMPLETE INTERVAL-VALUED HESITANT FUZZY PREFERENCE RELATIONS IN DECISION MAKING

In this article, we propose a method to deal with incomplete interval-valuedhesitant fuzzy preference relations. For this purpose, an additivetransitivity inspired technique for interval-valued hesitant fuzzypreference relations is formulated which assists in estimating missingpreferences. First of all, we introduce a condition for decision makersproviding incomplete information. Decision maker...

متن کامل

Arithmetic Aggregation Operators for Interval-valued Intuitionistic Linguistic Variables and Application to Multi-attribute Group Decision Making

The intuitionistic linguistic set (ILS) is an extension of linguisitc variable. To overcome the drawback of using single real number to represent membership degree and non-membership degree for ILS, the concept of interval-valued intuitionistic linguistic set (IVILS) is introduced through representing the membership degree and non-membership degree with intervals for ILS in this paper. The oper...

متن کامل

Code-Specific Learning Rules Improve Action Selection by Populations of Spiking Neurons

Population coding is widely regarded as a key mechanism for achieving reliable behavioral decisions. We previously introduced reinforcement learning for population-based decision making by spiking neurons. Here we generalize population reinforcement learning to spike-based plasticity rules that take account of the postsynaptic neural code. We consider spike/no-spike, spike count and spike laten...

متن کامل

UNCERTAINTY DATA CREATING INTERVAL-VALUED FUZZY RELATION IN DECISION MAKING MODEL WITH GENERAL PREFERENCE STRUCTURE

The paper introduces a new approach to preference structure, where from a weak preference relation derive the following relations:strict preference, indifference and incomparability, which by aggregations and negations are created and examined. We decomposing a preference relation into a strict preference, anindifference, and an incomparability relation.This approach allows one to quantify diff...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005